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This report summarizes the proceedings of the 14 th workshop of the Genomic Standards Con- 
sortium (GSC) held at the University of Oxford in September 2012. The primary goal of the 
workshop was to work towards the launch of the Genomic Observatories (GOs) Network un- 
der the GSC. For the first time, it brought together potential GOs sites, GSC members, and a 
range of interested partner organizations. It thus represented the first meeting of the GOs 
Network (GOsl). Key outcomes include the formation of a core group of "champions" ready 
to take the GOs Network forward, as well as the formation of working groups. The workshop 
also served as the first meeting of a wide range of participants in the Ocean Sampling Day 
(OSD) initiative, a first GOs action. Three projects with complementary interests - COST Ac- 
tion ES1 103, MG4U and Micro B3 - organized joint sessions at the workshop. A two-day 
GSC Hackathon followed the main three days of meetings. 
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Introduction 

The Genomic Standards Consortium (GSC) formed 
in 2005 to tackle the challenge of working as a 
community towards improving the quantity and 
quality of contextual data made accessible for ge- 
nomes and metagenomes [1]. The GSC works to- 
wards its goal through the creation, maintenance, 
and adoption of a range of standards and collabo- 
rative projects. Further information about the GSC 
and its range of activities can be found on the GSC 
wiki pages [2]. Here we describe the proceedings 
and outcomes of the GSC's 14th meeting (GSC14) 
at Oxford University, 17-19 September 2012. 

The main theme of GSC 14 was "genomics enabled 
long-term, place-based research". A core mission 
of the GSC is to improve the contextualization of 
genomic data, and perhaps the best chance to un- 
dertake highly contextualized genomic research 
over the long-term is to leverage places that are 
already the focus of sustained ecological, envi- 
ronmental, and social research. Discussion of a 
GSC focus on "place-based" genomic research be- 
gan at the GSC 11 meeting in Cambridge, UK in 
April 2011 and continued at the GSC 12 meeting in 
Bremen, Germany in September that same year. 
These discussions led to a call for an international 
network of "Genomic Observatories" from scien- 
tists associated with various intensively studied 
sites around the world [3]. The GOs concept was 
further developed at the GSC's 13th meeting in 
Shenzhen, China in March 2012 [4]. GSC 14 repre- 
sented the first formal meeting of the emerging 
GOs Network, bringing together the GSC commu- 
nity, potential GOs sites, and GOs partner organi- 
zations to identify shared interests and consider 
strategies for the future. 

The meeting was structured into six sessions that 
aimed to define the GOs Network mission, outline 
some initial actions, and explore options for its or- 
ganizational structure. Workshop objectives in- 
cluded to (i) present candidate GOs sites and GOs 
partner organizations, (ii) discuss GOs actions, 
(iii) develop interactions between GOs and the 
broader GSC community, and (iv) give the partici- 
pants time to form working groups and deliberate 
priorities. The first session that introduced the 
work of the GSC was co-organized and supported 
by the European Cooperation in Science and 
Technology (COST) Action (ES1103) [5]. Follow- 
ing the main meeting, two satellite meetings were 
organized and supported by EU FP7 projects. The 
first, Marine Genomics for Users (MG4U) [6] was a 



stakeholders session considering how to translate 
advances in genomic research to ecosystem-based 
marine management actions and policy perspec- 
tives. The second satellite was a planning meeting 
of the Marine Biodiversity, Bioinformatics, Bio- 
technology (Micro B3) project [7] that leads Ocean 
Sampling Day, one of the first GOs actions. A two- 
day GSC hackathon also followed the meeting, 
which focused on issues related to the GOs infor- 
matics stack. The workshop presentations as well 
as interviews with some of the attendees were 
recorded by Mediomix ( http : / / mediomix.de / ) and 
are available on YouTube (http://www.you- 
tube.com/playlist?list=PLgacjRIHqvMD6HleJaD0i 
HzyoANpc2L41). 

The 15th Genomic Standards Consortium (GSC 15) 
meeting will be held at the National Institute of 
Health (NIH) in Bethesda, Maryland, 22-24 April 
2013. This meeting will highlight the utilization of 
genome metadata standards for the enhancement 
of our biological understanding of microbes, the 
interaction between microbial genomes, human 
health and disease. It will include a GOs Network 
session (considering the potential of citizen sci- 
ence genomics and humans as GOs) and a GOs sat- 
ellite meeting at the Smithsonian Institution (fo- 
cused on marine genomic observing systems and 
Ocean Sampling Day). Please see the GSC 15 home 
page [8] for further details. 

Session I: GSC Standards - 
Introduction and updates 

Chaired by Peter Sterk (University of Oxford, UK), 
the opening session served to familiarize new- 
comers with the GSC and the range of its activities. 
It was hosted as a joint session with the Microbial 
Ecology & the Earth System [5] consortium, a Eu- 
ropean Cooperation in Science and Technology 
(COST) action. The session introduced new partic- 
ipants to the GSC and explored how to work with 
the GSC standards. 

Dawn Field (NERC Centre for Ecology and Hydrol- 
ogy Wallingford, UK; University of Oxford, UK) in- 
troduced "An Open Call to work under the umbrel- 
la of the GSC" describing the background of the 
GSC and announcing its new policy for initiating 
projects. The GSC is an open membership commu- 
nity working towards better-standardized de- 
scriptions of genomes, metagenomes and marker 
gene sets [1]. Standards are key to accelerating 
science, avoiding redundancy and enabling inte- 
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gration of data streams. The GSC runs a range of 
consensus-driven projects and is calling for com- 
munity involvement and encouraging broader 
compliance. The GSC welcomes the involvement of 
the GOs Network as a GSC Project/Working Group 
and looks forward to future interactions. Among 
other things, the GOs Network will provide use- 
cases for further developing GSC standards and 
showcases for the scientific opportunities that flow 
from their application. 

Renzo Kottmann (Max Planck Institute Bremen, 
Germany) then described the background and 
mission of the "Microbial Ecology & the Earth Sys- 
tem" COST action. He introduced the microbial 
ecology workflow from field sampling to analysis 
and publication and the need for standards in this 
domain. He explained how the GSC's Minimum In- 
formation about any Sequence (MIxS) standard 
specifically covers the "xyzt" metadata (latitude, 
longitude, depth and time] that are fundamental 
to field biology. Pelin Yilmaz (Max Planck Institute 
Bremen, Germany), the lead author on the GSC's 
MIxS publication, then gave a detailed description 
of the development of the MIxS standard and its 
use today [9]. 

Seven short talks covered aspects of the emerging 
"Microbial Ecology Data Pipelines". The first 
presentation was by Steven Stones-Havas 
(Biomatters Ltd., New Zealand) describing work to 
connect Field Information Management Systems 
(FIMS) to Laboratory Information Management 
Systems (LIMS) and specifically the FIMS/LIMS 
developed for the Moorea Biocode Project [10]. 
One product of this work is the open-source 
Biocode LIMS - a free plugin for DNA barcoding on 
the Geneious software platform [11]. The next 
talks focused on analysis of data and included an 
update on the newly launched metagenomics por- 
tal (MG Portal) [12] at the European Bioinformat- 
ics Institute (Sarah Hunter, European Bioinfor- 
matics Institute, UK), an update on CAMERA [13] 
(Jeff Grethe, University of California San Diego, 
USA), and presentation of the BIOM standard as a 
new addition to QIIME [14] (Jai Ram Rideout, Uni- 
versity of Colorado, Boulder, USA). Anna 
Klindworth (Max Planck Institute Bremen, Ger- 
many) then talked on the evaluation of primer and 
primer pairs for 16S ribosomal RNA biodiversity 
studies. Finally, Guy Cochrane (European Bioin- 
formatics Institute, UK) outlined submission of 
MIxS-compliant data to the International Nucleo- 
tide Sequence Database Collaboration (INSDC) 

Table 1: Candidate Sites with their GSC 14 represen 



[15] and provided some insight into the challeng- 
es for those designing and operating submission 
services. George Garrity (Michigan State Universi- 
ty, USA) finished the session with an update on the 
GSC's open access journal "Standards in Genomic 
Sciences" (SIGS) [16]. Overall, the presentations 
highlighted the importance of consistent genera- 
tion and flow of MIxS compliant data through Mi- 
crobial Ecology Data Pipelines from field sampling 
to analysis and publication. The MIxS standard 
adds value to the sequence data and better en- 
sures comparability of different sequence data 
sets. 

GOs Sites and GO Network Partners 
(Sessions II & IV: Lightning talks) 

GSC 14 introduced two sets of key participants in- 
terested in joining or collaborating with the GOs 
Network: (a) representatives of intensively stud- 
ied sites (Session II, Table 1) and (b) organiza- 
tions, networks, and initiatives interested in part- 
nering with GOs (Session IV, Table 2). The absence 
of some sites already engaged in the discussions of 
the GOs Network was noted, and it was also 
acknowledged that other very worthy sites had 
not yet been approached. All agreed that a broad- 
er geographic representation of GOs would be de- 
sirable in the future. Likewise, although a wide 
range of potential partners was represented, some 
were absent and others that should be involved in 
the founding of the network have not yet been 
contacted. Dawn Field (NERC Centre for Ecology 
and Hydrology Wallingford, UK; University of Ox- 
ford, UK) chaired the session and gave a brief 
overview of the GOs Network since the initiation 
of the concept following GSC 11 (April 2011) 
when she and Neil Davies began discussing the 
benefits of place-based genomic research at the 
"L4" and "Moorea" sites [4]. Subsequent discus- 
sions with an ever growing number of researchers 
pioneering DNA-based approaches at premier re- 
search sites, led to a letter published in early 2012 
by 68 authors (many present at GSC 14) calling for 
the creation of a Genomic Observatories Network 
[3]. GSC 14 offered the first chance for this group 
to meet and decide the future of the GOs Network. 

The candidate sites (potential Genomic Observa- 
tories) all represented long-term research sites 
with decades to a century of data thus offering 
unique opportunities to integrate genomic re- 
search into a range of other research activities. 

tatives/authors 
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Candidate Site Representative 



Asia-Pacific 




Moorea, French Polynesia 


Serge Planes (CNRS-EPHE CRIOBE) and Neil 




Davies* (UC Berkeley) 


Hauturu, New Zealand 


Alexei Drummond (Auckland) 


Europe 




Rothamsted, UK 


Penny Hirsch (Rothamsted) 


Plymouth L4, UK 


Tim Smyth (Plymouth) 


Roscoff, France 


Nathalie Simon (UPMC-CNRS SBR) 


Helgoland, Germany 


Gunnar Gerdts (Alfred Wegener Institut) 


Naples MareChiara LTER, Italy 


Adriana Zingone (SZN) 


Crete, Greece 


Georgios Kotoulas (HCMR) 


Banyuls, France 


Ian Salters (Banyuls) 


Polar 




Ny-Alesund Svalbard Arctic & Rothera Antarctica Melody Clark (BAS) 


Americas 




Northern Temperate Lakes LTER, USA 


Trina McMahon (University of Wisconsin) 


Western Atlantic, Smithsonian Marine Science 


Chris Meyer (Smithsonian) 


Network 


Southern California Coastal Waters, USA 


Steve Weisberg (SCCWRP) 


* Presenter 




Table 2. Candidate partners and their representatives at GSC 14 


Candidate partners 


Representative 


Networks of Sites 




European Network of Marine Research Insti- 


Michael Thorndyke (SLC Sweden); presentation 


tutes and Stations (MARS) [1 7] 


given by Matthias Obst 


U.S. Western Association of Marine Laborato- 


Steve Weisberg (Southern California Water Re- 


ries (WAML) [18] 


search Program) 


Smithsonian Global Earth Observatory (SI 


Eldridge Bermingham (Smithsonian) 


GEO) [19] 




Global Lake Ecological Observatory Network 


Trina McMahon (University of Wisconsin) 


(GLEON) [20] 




Infrastructures 




European Marine Biological Resource Center 


Colin Brownlee (MBA) 


(EMBRC)[21[ 




Global Genome Initiative (GGI) [22] 


Jonathan Coddington (Smithsonian) 


Barcode of Life (IBOL, BOLD) [23] 


Sujeevan Ratnasingham (Guelph) 


Group on Earth Observations Biodiversity Ob- 


Dan Faith (Australian Museum) 


serving Network (GEO BON) [24] 




Institutes, Initiatives and Programs 




Berkeley Initiative Global Change Biology 


Michelle Koo (Berkeley) 


(BIGCB) [25] 




International Soil Metagenome Sequencing 


Folker Meyer (Argon ne) 


Consortium & RCN (Terragenome) [26] 




Biodiversity Virtual e-Laboratory (BioVeL) [27] 


Matthias Obst (Gothenburg) 


Earth Microbiome Project (EMP) [28] 


Jack A. Gilbert (Argon ne/Chicago) 
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The range of candidate partners included net- 
works of sites, institutions (field stations, marine 
labs, universities, research institutes), and organi- 
zations/programs that contribute to and benefit 
from place-based research in genomics [e.g., by 
providing essential contextual data and/or re- 
search infrastructure). Each site and partner was 
invited to give an overview of its history, research 
and future plans to work as a genomic observato- 
ry. Since talks were only 5 minutes, they were all 
structured into a common template with sections 
for (i) Genomic Observatories (Site Description, 
Institutional Partners, Facilities, Access to Biodi- 
versity, Data Sets) and (ii) Genomic Observatories 
Partners (History and Structure, Mission, Assets, 
How (meta)data standards are employed, What 
you would gain from and contribute to a GOs Net- 
work). Subsequent to GSC14 all sites and partners 
will be invited to expand their presentation into 
short papers. 

Session III: GOs First Actions 

The last session of Day 1 was the first opportunity 
at this meeting to have an open discussion about 
how best to build the Network. To drive discus- 
sions of what a GOs Network could contribute to 
science, the Ocean Sampling Day (OSD) initiative 
[29], which involved several candidate GOs sites, 
was discussed as a case study. Two brief presenta- 
tions were given to set the stage for the discus- 
sions. First, the OSD initiative was described by 
Frank Oliver Glockner (Jacobs University Bremen 
& Max Planck Institute Bremen, Germany) in the 
context of the EU 7FP Micro B3 (Biodiversity, Bio- 
informatics, Biotechnology) project [7], which is 
leading OSD. Micro B3 will develop innovative 
bioinformatic approaches and a legal framework 
to make large-scale data on marine viral, bacterial, 
archaeal and protist genomes and metagenomes 
accessible for marine ecosystems biology and to 
define new targets for biotechnological applica- 
tions. The project builds upon a highly interdisci- 
plinary consortium of 32 academic and industrial 
partners comprising world-leading experts in bio- 
informatics, computer science, biology, ecology, 
oceanography, bioprospecting and biotechnology, 
as well as legal aspects. Ocean Sampling Day is a 
global, simultaneous megasequencing campaign 
that will take place on the June solstice 2014. OSD 
is designed to collect standardized data suitable 
for modeling the distributions, abundances and 
functions of microbes in the world's oceans. A first 



pilot run, which leveraged the emerging GOs Net- 
work, took place on the solstice in June 2012. 

The second talk presented crowd funding as a new 
way to support aspects of such international pro- 
jects. Wolfgang Hankeln (Mediomix, Germany) 
made a short pitch for the OSD community to let 
his company create a movie about OSD funded by 
public contributions to the project. 

The chair of the session, Chris Meyer (Smithson- 
ian, USA) then invited all of the OSD participants 
at the meeting to join an open panel discussion. 
Participants included Georgios Kotoulas (Crete), 
Burak Ali Cicek (Cyprus), Gunnar Gerdts (Helgo- 
land), Neil Davies (Moorea), Adriana Zingone (Na- 
ples), Nathalie Simon (Roscoff), Melody Clark 
(Rothera), Viggo E>6r Marteinsson (Iceland), Tim 
Smyth (Plymouth L4). 

Each participant in the pilot 2012 OSD was asked 
to self-sort their participation along an axis of ease 
in participation from 1 being difficult to 10 being 
very easy. This created a gradient of partners who 
found the sampling and processing easy to do with 
minimal extra effort, at one end, versus more chal- 
lenging and significant additional effort, at the 
other. Most of the pilot sites characterized them- 
selves well toward the "easy side" of the spectrum, 
with a few partners sprinkled down the chain. 
Discussions then focused on the difficulties en- 
countered in order to better coordinate and pre- 
pare for a more extensive effort. OSD is a first ex- 
ample of the way that coordinated research pro- 
jects [e.g., Micro B3) might leverage the GOs Net- 
work. For example, many of the sites participating 
in the OSD pilot were not part of the Micro B3 
grant. They contributed time and resources to 
make the collections because of their interest in 
developing common standards-based genomic da- 
tasets and in sharing their experiences with others 
in a mutual knowledge exchange (hence their par- 
ticipation in the GOs Network. 

It was agreed that the common platform (instru- 
ment) the GOs Network could represent, especial- 
ly when specifically proven and tested by initia- 
tives such as OSD, would enable many other future 
projects involving coordination and comparable 
approaches across regions, and eventually the 
globe. 
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Sessions V-VII: Breakout Groups and 
the formation of GOs Working Groups 

A key aim of this meeting was to establish persis- 
tent mechanisms for building the network, to take 
forward actions, and establish shared best prac- 
tices. This included discussing preferred govern- 
ance models and establishing working groups to 
help build the GOs strategy. The breakout sessions 
were dedicated to the exploration of four topics 
and the governance of the GOs Network. 

To set the scene for the breakout sessions, Robert 
Robbins (University of California San Diego, USA) 
chaired a session with presentations by the leads 
of each breakout group. Talks covered (i) Biodi- 
versity Genomics, (ii) Theory and Modeling, (iii) 
Informatics (Biocode Commons), and (iv) 
Bioarchiving and Access and Benefit Sharing 
(ABS). For each topic, the chair gave a 20-minute 
"charge" talk and then the groups broke out to 
discuss, supported by a panel of experts in the 
topic. Day Three was dedicated to giving time for 
all the breakout groups to report back to the full 
group. 

The Biodiversity Genomics group, chaired by Chris 
Meyer (Smithsonian, USA), was deemed so central 
to the GOs Network that the core group of the GOs 
Network was tasked to develop it further. Topics 
covered in discussion included in particular the 
biology of Dark Taxa and how this presented chal- 
lenges for data acquisition and analysis. Are new 
approaches needed in taxonomy, especially as 
more microbial data are generated where un- 
known taxa (and genes) are routinely found? In 
subsequent cross-discussions between the work- 
ing groups it became clear that developing the sci- 
ence of biodiversity genomics was a general goal 
of the GOs Network that would require input from 
all, but especially the Theory and Informatics 
working groups. 

Following the meeting it was agreed that the GOs 
Network would take forward its activities under 5 
key umbrella areas: Science, Informatics, Tech- 
nologies, Policy, and Governance. The Theory & 
Modeling group would fall under Science, Biocode 
Commons under Informatics, ABS under the 
broader topic of Policy (including data manage- 
ment policies and the social/cultural challenges of 
applying biodiversity genomics to achieve policy 
goals, such as 'good environmental status'). Gov- 
ernance would be formalized into the leadership 
of the GOs Network. The Technologies thematic 
area was added to cover Bioarchiving (handling 



and preserving samples from field collection to ex 
situ repositories) as well as the rapid evolution of 
molecular analytical approaches, such as in situ 
ecogenomic sensor platforms. This structure pro- 
vides ample coverage of topics and will allow the 
GOs Network to grow areas as required. Coordina- 
tion across these main thematic areas between 
groups will be the responsibility of the core group 
of the GOs Network - an informal steering commit- 
tee - until and unless a more formal mechanism is 
agreed. 

SCIENCE: Theory & Modeling 

Alexei Drummond (Auckland, NZ) chaired the ses- 
sion with panel members Alfried Vogler 
(NHM/Imperial), Rampal Etienne (University of 
Groningen), Dan Faith (Australian Museum), Jack 
Gilbert (Argonne National Laboratory), and 
minutes taken by Pier Luigi Buttigieg (MPI Bre- 
men). 

Introduction: Before the meeting, a list of key sci- 
entific questions was identified for discussion by 
the breakout group [1]: How to assess (observe) 
biodiversity patterns at different temporal and 
spatial scales [2]? What are the rules governing 
macro-ecological patterns [3]? What do these 
rules tell us about what will happen next given 
major human-induced perturbations of the Earth's 
climate and ecosystems [4]? How do genomic 
technologies and place-based studies help? 

Group chair Alexei Drummond (Auckland, NZ) 
pointed out in his opening presentation that ecol- 
ogy and genomic technologies have been on a" col- 
lision course" for a number of years and are really 
starting to hit home now. To date, molecular ecol- 
ogy has typically focused on a small number of 
species and their genetic variation (usually from a 
small number of loci) in space and time. Genomic 
Observatories, however, now offer the opportuni- 
ty of studying much larger numbers of well- 
contextualized (e.g., with environmental sensors) 
genes (up to whole genomes), individuals, and 
species within and between ecosystems. Such ca- 
pabilities provide challenges (e.g., how to treat 
such large heterogeneous data streams) as well as 
opportunities to advance ecological theory. 

Goals: Identify the grand challenges in theory and 
modeling relevant to GOs 

Findings and Recommendations: The group ad- 
vised the GOs Network to base its future work on 
the fundamentals of good (scientific) theory, that 
is: (a) Consistent with observed phenomena, (b) 
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Has predictive power, (c) Applicable to a broad 
range of phenomena, (d) Simple / parsimonious. 

The kinds of macro-ecological observations and 
phenomena (data to be understood/ modeled] by 
the GOs Network was found to include: 

•Species abundance distributions in an area 
(SADs) 

• Species-area relationship (SAR) and Phylogenet- 
ic Diversity-area relationship 

•Abundance-occupancy correlations (decay of di- 
versity similarity with distance) 

• Next-generation environmental sequence data 
and associated environmental measurements 
(temperature, humidity, soil chemistry) 

The GOs Network, with its highly characterized 
sites, could make important contributions to ma- 
jor questions such as [1]: understanding the na- 
ture and significance of the long tail of rare taxa in 
ecosystems (at the species and the genetic level), 
and [2] assessing the importance of interactions 
within and among levels of biological organization 
(species, individual organisms, genes). 

How to bring together macro-ecological theories 
that assume no significant interactions among 
species (e.g. [30,31].) with those that are explicitly 
based on interactions (e.g [32].) could provide an 
overarching challenge for the GOs Network to ad- 
dress. In any case, ecological network theory is 
likely to be very important for the GOs Network, 
which should represent a powerful source of data 
on: 

•Consumer-resource body size ratios 
•Metabolite/Resource use networks 
•Distributions of connectivity 
•Interaction strengths 

Intensive study at GOs sites and use of genomic 
approaches is also likely to provide a wealth of 
valuable trait information, such as: 

•Trait distributions of species in an area (e.g. bi- 
omass) 

•Trait distributions of individuals in a species or 
community (e.g. body size) 

•Distributions of community-level functional 
traits (e.g., through metagenomics) 

•Species range size distributions 

Given the likely evolution of the GOs Network and 
the types of data it will generate, the following 
questions should be developed at future GOs Net- 
work meetings: 



i) Does greater diversity imply greater func- 
tionality and/or resilience? Can GOs Network 
address this (semi) quantitatively? 

ii) What kind of functions are we talking about? 
Are they readily measurable? How many spe- 
cies are present/are missing? The GOs Net- 
work could provide (more) solid data points 
from which one could extrapolate (if a coher- 
ent sampling design is in place). 

iii) What are the marginal losses/impacts of 
losing a given instance (gene, species, func- 
tion) of an ecosystem - what is the marginal 
value of incremental increase/loss of biodi- 
versity? 

INFORMATICS: Biocode Commons - 
the Informatics Stack for Genomic 
Observatories 

John Deck (University of California Berkeley, USA) 
and Steven Stones-Havas (Biomatters Ltd., New 
Zealand) chaired the session, with panel members 
Renzo Kottmann (Max Planck Institute Bremen, 
Germany), Norman Morrison (University of Man- 
chester, UK), Philippe Rocca-Serra (University of 
Oxford, UK), Mesude Bicak (NERC Centre for Ecol- 
ogy and Hydrology Wallingford, UK). 

Introduction: The Biocode Commons aims to co- 
ordinate the interoperable and standards based 
"informatics stack" for the global network of Ge- 
nomic Observatories. The GOs Network user re- 
quirements will provide the overarching strategic 
direction for the Biocode Commons and will serve 
as its major user community (although the scope 
of the Biocode Commons could expand in the fu- 
ture). The space between standards and software, 
standards and other standards, and between 
software and other software, is what we are call- 
ing the "commons" here and "Biocode Commons" 
specifically addresses the gap present in the bio- 
diversity genomics community between collec- 
tions-based systems (commonly museum speci- 
mens and tissues) and sequence-based systems 
(DNA and increasingly a range of other 'omics data 
types). 

Goals, Findings, and Recommendations: The 

goals for this session were to define the data 
types, data formats, software, standards, and gov- 
ernance structure for the Biocode Commons. The 
categories of data the group outlined as being rel- 
evant for the GOs Network included: meta, pheno- 
typic, environmental, taxonomic, genomic, geo- 
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graphic, and sample. The group defined a list of 
many different formats for sharing data, including 
BIOM, ISA-tab, GCDML, ABCDDNA, INSDC struc- 
tured formats, Darwin Core archives, to name but 
a few. Software was discussed and the group out- 
lined several principles that software should ad- 
here to including versioning, and creating both 
human and machine interfaces. Finally, the group 
suggested compiling a registry of Genomic Ob- 
servatories (sites) and to enable local repositories 
to construct globally unique identifiers for all 
samples. Discussions are ongoing within the 
Biocode Commons [20]. 

POLICY: Bioarchiving, Access and 
Benefit Sharing 

The first foray into the general area of "policy" for 
the GOs Network dealt with the topic of access and 
benefit sharing (ABS). The session was chaired by 
Jonathan Coddington (Smithsonian Institution, 
USA) with participants Michele Barbier (Mediter- 
ranean Science Commission, France), Katie Barker 
(Smithsonian Institution, USA), John Benzie (Uni- 
versity College Cork, Ireland), Arianna Broggiato 
(Universite Catholique de Louvain, Belgium), Neil 
Davies (University of California Berkeley, French 
Polynesia), Chris Hunter (European Bioinformat- 
ics Institute, UK), Anna Klindworth (Max Planck 
Institute Bremen, Germany), Chris Lyal (Natural 
History Museum, London, UK), Frank Oliver 
Glockner (Max Planck Institute Bremen, Germa- 
ny), Johanna Wesnigk (EMPA Bremen, Germany), 
Antje Wichels (AWI, Germany) 

Introduction: Place-based genomic research re- 
quires sampling and sharing of physical biodiver- 
sity (samples/specimens), often among institu- 
tions and across national borders. These exchang- 
es of biomaterials (and associated digital data and 
metadata) take place over periods of many years. 
This creates challenges that are both tech- 
nical/biochemical (how to handle samples in the 
field in the short term, and how to transport and 
store them over the long term?) and legal (what 
are the terms of use for the biomaterials sampled - 
notably ABS - and the cascade of subsamples, data, 
and other products that flow from them?) 

Goals: This session covered two topics: (i) Captur- 
ing Genomes (Sample handling and preservation 
protocols along the supply chain - from field to re- 
pository - for future 'omics studies), and (ii) Ac- 
cess and Benefit Sharing (ABS) across GOs sites. A 
third relevant topic, Informatics (tracking materi- 



als and data from collection through analyses and 
publication) was already the theme of another 
breakout group (Biocode Commons). 

Findings: Participants agreed that the main two 
topics are distinct and complex and should be the 
subject of separate working groups in the future. 
The participants thought that a third topic of 
"Ecogenomic Technologies" might form another 
future working group addressing how much can 
be automated in situ and how such mechanization 
(including robotics and ecogenomic sensors) 
would help standardization of protocols and 
tracking of metadata. 

Recommendations: GOs should form separate 
working groups: 

i) Access and Benefit Sharing Working Groups 
(ABS WG) 

ii) Capturing Genomes Working Group (CG 
WG), to include initially 'Ecogenomic Tech- 
nologies' 

Actions: Coordinate with the Global Genome Initi- 
ative (GGI, see http://www.mnh.si.edu/ggi/) to 
host a workshop on Capturing Genomes at Smith- 
sonian in conjunction with the upcoming GSC 15 
meeting in April 2013. 

The group then focused its main discussions on 
ABS and became effectively the first meeting of 
the GOs ABS Working Group. 

Deliverables: The session participants recog- 
nized that the detailed text in each ABS agreement 
depends on the provider country that has ultimate 
jurisdiction (under CBD). Consequently, the text of 
ABS agreements is likely to vary somewhat from 
country to country, even if all essential aspects of 
a project are otherwise identical across GOs sites. 
Despite this inevitable complexity, a general ABS 
philosophy for the GOs network is important in 
helping establish trust and in guiding the text of 
specific agreements. Furthermore, template 
agreements for multi-site projects across the GOs 
network can help to facilitate the ABS process by 
providing useful text that will apply at all partici- 
pating GOs. The ABS WG therefore agreed to col- 
laborate on two deliverables: 

i) Code of Conduct for Genetic Resource Access 
and Use in the GOs Network. This non-binding 
Memorandum of Understanding (MoU) would 
harmonize ABS philosophy across museums 
(GOs partners) and field stations, (GOs sites) 
the latter being key trusted partners in pro- 
vider countries, the former being the primary 
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institutions tasked with maintaining ex situ 
collections for the global scientific communi- 
ty. Input for this code of conduct will come 
from various sources, such as the Mediterra- 
nean Code of Conduct being initiated by 
CIESM (The Mediterranean Science Commis- 
sion] 

ii) Ocean Sampling Day (OSD) Use Cases - As 
OSD is the first GOs action, the group will help 
draft the following OSD documents, providing 
specific examples for future projects: 

a A template OSD ABS Agreement. Guided by 
the Code of Conduct, the group will help 
draft a specific 'template' for OSD, initially 
aimed at non-commercial use. A subsequent 
version for commercial use will be prepared 
before December 31, 2012. 

b A template OSD Material Transfer Agree- 
ment for bioarchiving. The group will draft 
an example agreement for archiving OSD 
samples with OSD repositories. The exam- 
ple agreement will function as an agree- 
ment between each participating OSD part- 
ner and the Smithsonian Institution 
(NMNH], an OSD repository. A final version 
will be prepared in conjunction with the 
OSD ABS agreement. 
The first versions of the documents have been 
created as part of the efforts of Micro B3's 
workpackage 8 [33]. 

GOVERNANCE: Building a long-term 
vision for the Genomic Observatories 
Network 

Introduction: This session provided the chance for 
interested GSC 14 participants to consider govern- 
ance models for the GOs Network. Going into the 
meeting, the GOs Network was incubated under the 
GSC as a GSC Project and it was agreed by the group 
that this working relationship is appropriate and 
should remain. Frank Oliver Glockner (Max Planck 
Institute Bremen, Germany) chaired the session, 
which began with a brief introduction by Dawn 
Field (CEH Oxford, UK) on how the GSC is currently 
governed. The GSC has a Board of 20 academics 
who have taken on a leadership role in GSC pro- 
jects. At the time of writing, fifteen projects now 
work under the umbrella of the GSC, including the 
GOs Network, and the Biocode Commons, and new 
project proposals are welcomed. The lead or leads 
of each GSC project are expected to complete a pro- 



ject description that should be updated annually 
and to advance activities through balanced working 
groups. It is expected that larger projects, like the 
GOs Network might eventually have more formal 
governance mechanisms (steering committees, 
etc). All projects working under the GSC umbrella 
are expected to contribute to the overall GSC mis- 
sion, help to define GSC strategy, attend and help 
organize GSC events, and report on progress in a 
regular fashion to the GSC Board and to the broader 
GSC community. Project descriptions have already 
been drafted for the GOs Network, Biocode Com- 
mons and the "Minimum Information about a Ge- 
nomic Observatory" (MIGO) specification and are 
available for comment. 

For the rest of the session, participants then split 
into two groups to discuss the governance of the 
GOs Network as a whole and the Biocode Commons 
in particular (as the informatics arm of the GOs 
Network - see description above). Dawn Field 
(NERC Centre for Ecology and Hydrology Walling- 
ford, UK; University of Oxford, UK) chaired the dis- 
cussion of the GOs Network governance with panel 
members Neil Davies (University of California 
Berkeley, French Polynesia), Jack A. Gilbert (Ar- 
gonne National Laboratory, USA; University of Chi- 
cago, USA), Matthias Obst (University of Gothen- 
burg, Sweden), Chris Meyer (Smithsonian Institu- 
tion, USA), Linda Amaral-Zettler (Marine Biological 
Laboratory, USA). 

The GOs Governance breakout group report fol- 
lows: 

Goals: The first agenda item of the breakout ses- 
sion was to re-affirm, in the light of the previous 
sessions, that there was consensus among those 
present that the GOs Network should go forward. 
This was unanimously agreed upon. The second 
goal was to establish a mechanism of governance, 
no matter how informal, to take forward the Net- 
work post-GSC 14. To date, the GOs Network was 
promoted largely by the six Panel members of the 
session, who recognized that it should eventually 
have strong leadership drawn from many if not all 
represented GOs sites and partners. Key represent- 
atives from many related initiatives, especially 
large networks that may contain multiple GOs sites, 
should also be involved in some capacity in the fu- 
ture. The third goal was to work towards a mission 
statement and the creation of a Founding Charter. 

Recommendations: It was agreed that the GOs 
Network exists 'de facto' as a result of the group- 
authored Nature Correspondence [3] and currently 
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consists of a group of individuals with shared inter- 
ests. Formalizing what is meant by "Genomic Ob- 
servatory" and defining how/if individuals, sites, 
and partners should formally join the network 
would need to be developed in time. Membership 
in the Network was discussed at length and it was 
agreed that this was for the core group to work on 
over the coming months in consultation with the 
wider community. In particular, the group liked the 
GLEON [34] model of "grass-roots governance", in- 
troduced by Trina McMahon (University of Wiscon- 
sin) with membership open to both individuals and 
sites. It was clear that careful consideration is 
needed before labeling a site or institution a Ge- 
nomic Observatory. While some institutions with 
clearly identified sites and observatory programs 
might already 'brand themselves' as GOs (e.g., the 
Moorea Ecostation), it might take longer for others 
to reach the needed institutional approvals. Mod- 
eled on other networks, it was suggested that the 
GOs Network should draft a "Directors Letter" out- 
lining the goals and mission of the Network. GOs 
champions could give this letter to the directors of 
their institutions, perhaps leading to a Memoran- 
dum of Understanding, where an institution would 
effectively join the GOs Network and identify their 
study site/s as GOs. Along these lines, a draft Mis- 
sion Statement was discussed and it was agreed 
that this was a good start and should be the basis 
for further consultation and refinement along with 
a draft Founding Charter: 

Initial draft of a GOs Network Mission 
Statement 

i) To build a global network of premier re- 
search sites working to generate genomic 
biodiversity observations that are well- 
contextualized and compliant with global 
data standards. 

ii) To encourage long-term, place-based, 
DNA-centric programs that quantify biotic 
interactions in an ecosystem and develop 
models of biodiversity to predict the quality 
and distribution of ecosystem services. 

iii) To provide training, technical assistance, 
resources, and best practice guides as a 
learning platform for scientists and organi- 
zations wishing to carry out place-based ge- 
nomic observations, particularly those in de- 



veloping countries with high and/or vulner- 
able biodiversity. 
Findings: The session identified a core set of "GOs 
Champions" to take forward the deliberations and 
continue launching the GOs Network. A group of 
representatives from candidate GOs sites volun- 
teered and this core group was tasked with formu- 
lating the strategy for the Network and preparing 
for GSC 15 (Maryland, USA), GSC 16 (Brisbane, 
Australia) and other future GOs meetings, which 
would serve to broaden the geographic represen- 
tation of GOs. This session provided critical face- 
to-face discussion and the group spent a signifi- 
cant amount of the time exploring shared interests 
and priorities. It was further agreed that in rolling 
out this phased launch the GOs Network would 
aim to build momentum and actively recruit fur- 
ther sites. The currently involved sites are "pillar 
sites" that already have significant genomic stud- 
ies underway, but the GOs Network must evolve to 
be more balanced geographically, geopolitically, 
and scientifically. For example it should include 
more sites outside the US and Europe, more ter- 
restrial sites, etc. The initial focus on "model sys- 
tems" (pillars) yields many benefits including ac- 
celeration of scientific returns when a particular 
site is studied in detail, including the genomic lay- 
er of biodiversity. The session participants con- 
sidered it important that the GOs Network should 
be open to sites that are just being established, as 
well as targeting sites that already have rich scien- 
tific legacies but which are just starting genomic 
research. The group also agreed that in terms of 
priorities, the GOs Network should aim initially at 
sharing experiences, techniques, know-how, best 
practices, and standards to better study each indi- 
vidual site. As the community matures, however, it 
will become a distributed research platform (in- 
strument) able to utilize these shared approaches 
to address joint scientific questions that require 
coordinated study across a diversity of sites. 

Finally, it was recognized that Biodiversity Ge- 
nomics is one of the unifying themes of the GOs 
Network and that this should be clearly articulat- 
ed in future refinements of the Mission Statement 
and Founding Charter for discussion/validation 
with the broader community. 

Actions: A final wrap-up session was chaired by 
Linda Amaral-Zettler (Marine Biological Laborato- 
ry Woods Hole, USA). Priority actions immediately 
following GSC 14 were to write the GSC 14 meet- 
ing report, plan GSC 15 and the second GOs meet- 
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ing (G0s2), review the Draft Founding Charter, 
formalize the concept of membership, take owner- 
ship of GSC project descriptions and support the 
working groups in defining further activities and 
priorities. 

Deliverables: The key deliverables to be generat- 
ed by the core group include a Founding Charter 
and Mission statement and a mechanism for for- 
malizing the Network through definitions of 
membership. 

Formal Close of GSC 14 

At the close, the GSC 14 workshop organizers 
handed over to the GSC 15 workshop organizers. 
The main meeting was adjourned and after lunch 
the MG4U stakeholder workshop and the GSC 
hackathon both started. 

Satellite 1 . GOs Stakeholders workshop. 

The workshop had the title "Potential of Genomics 
Technology for Marine Monitoring and the Marine 
Strategy Framework Directive (MSFD)", and was 
arranged in collaboration with MG4U [6]. The 
meeting was organized by Matthias Obst (Univer- 
sity of Gothenburg, Sweden) and chaired by John 
Benzie (University College Cork, Ireland), and as- 
sembled 31 genomics scientists as well as repre- 
sentatives of marine policy bodies. The program 
featured short scientific talks, breakout sessions 
and panel discussions [35]. 

Introduction: The Marine Strategy Framework 
Directive (MSFD) is a legal framework that de- 
mands a repertoire of knowledge from the Euro- 
pean member states centered around the de- 
scriptors of 'good environmental status' (GES). 
The descriptors, together with associated 'criteria' 
and 'indicators' will be used to decide on the sta- 
tus of marine ecosystems, and how GES can be 
achieved and maintained in the future. On this ba- 
sis, the member states have to provide the 'initial 
assessment' (of current status) and national de- 
termination of GES for their waters, including the 
nationally relevant targets and indicators. The 
member states are currently in the process of re- 
porting these marine indicators to the Commis- 
sion, and these will be the basis for their national 
marine monitoring programs from 2014 onwards. 
Currently there is a strong focus on three issues, 
(i) data, (ii) methods to turn data into knowledge, 
and (iii) boundaries between 'good' and 'no good' 
status. There are still significant knowledge gaps 
in the understanding of marine ecosystems, espe- 
cially when following an ecosystem-based ap- 



proach. Furthermore, in many cases important 
baseline knowledge that is necessary to define the 
GES of European marine waters is missing. Cur- 
rently there are no genomic (many genes across 
genome) and few genetic (single or small number 
of genes or markers) methods considered for con- 
tribution to the MSFD indicators. 

Goals: The goal of the workshop was to discuss 
how genetic information and analytical methods 
can contribute to cost-efficient monitoring of ma- 
rine ecosystems. The workshop was used in par- 
ticular to identify genomic methods that can con- 
tribute knowledge to the descriptors of the MSFD, 
and to discuss how genomic methods can be inte- 
grated into marine monitoring programs. 

Findings: Genomic methods have a high potential 
to provide ecosystem-based knowledge in multi- 
ple areas of marine monitoring, such as providing 
a consistent and high quality standard of species 
level identification, generating biodiversity esti- 
mates that include all branches of life, providing 
estimates of ecosystem function at all biological 
levels, discovering ecosystem wide trophic inter- 
actions, and estimating genetic diversity at the 
population level. The group concluded that ge- 
nomic methods with high potential fall into four 
categories: 

i) Methods that generate the same knowledge 
faster, cheaper, and/or better compared to 
conventional methods (e.g., species identifi- 
cation using marker genes such as DNA 
barcoding); qPCR for water quality assays; 
microarrays for detection of Harmful Algal 
Blooms - HABs) 

ii) Methods that allow us to do things we 
could not do before (i.e., give us new types 
of data and knowledge, such as 
metagenomics to study the biodiversity and 
function of whole ecosystems) 

iii) Methods that allow us to go from patterns 
to processes and unravel causalities (e.g., 
transcriptional response of species to chemi- 
cal exposure) 

iv) Methods that have no alternative aside 
from molecular methods (e.g., SNPs for 
tracing populations of species and DNA bar- 
coding for analyzing food webs) 

It was agreed that genomic methods have a high 
potential to address many descriptors in a stand- 
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ardized way. Also, the fact that there are currently 
no genetic or genomic approaches in the monitor- 
ing programs will make it relatively straightfor- 
ward to introduce the standards developed by the 
GSC into Marine Monitoring. 

The methods that meet the above criteria, are rou- 
tinely established, and have existing pilots are (i) 
qPCR, (ii) DNA barcoding using marker genes, (iii) 
SNPs, and (iv) microarrays. These methods are 
cost-efficient and able to add significant 
knowledge to descriptors, including: Dl (Biologi- 
cal diversity), D2 (Non-indigenous species), D3 
(Populations of fish and shellfish), D4 (Food 
webs), and D5 (Eutrophication), and D6 (Seafloor 
integrity). There are more methods with high po- 
tential in development, such as microbial 
metagenomics and transcription analysis, among 
others. 

Recommendations: The entry point for these 
methods into regular monitoring programs should 
be at the national level, and so genomic scientists 
should partner with national institutes that cur- 
rently implement the MSFD indicators. At the 
same time, a strong network should be developed 
in order to communicate the benefit of genomic 
tools to national environmental agencies, and to 
design pilot programs on the national and regional 
level. The network should include programs like 
the COST action EMBOS, the Micro B3 action 
Ocean Sampling Day, the FP7 project DEVOTES, 
the Genomic Observatories Network, the Ge- 
nomics Standard Consortium, the EMBRC infra- 
structure, FP7 Project STAGES, and European ma- 
rine GEO-BON initiatives. 

Action: As a follow up to the stakeholder work- 
shop, MG4U will circulate and assemble a white 
paper entitled 'Genomics in marine monitoring: 
new opportunities for assessing marine health 
status', including guidelines for the integration 
process of genomic methods into ongoing moni- 
toring programs. This roadmap will contain a 
summary of methods that have the potential to 
contribute to or generate new knowledge mapped 
against MSFD indicators, and a cost-benefit com- 
parison (based on existing case studies that use 
these methods), as well as a discussion of analyti- 
cal approaches. In addition, the white paper will 
provide an overview of infrastructures needed for 
genomic monitoring (e.g., GOs Network) and 
stakeholders involved, such as national environ- 
mental agencies working with the MSFD, and also 
Regional Seas Conventions and the EU commis- 



sion. The paper will be used as a guiding docu- 
ment for communication with organizations in- 
volved in monitoring. 

Deliverables: MG4U and GSC will deliver the 
white paper 'Genomic Methods in Marine Monitor- 
ing and the Marine Strategy Framework Directive 
(MSFD)'. 

As a follow up to the stakeholder workshop, MG4U 
will circulate and assemble a white paper entitled 
'Genomic Methods in Marine Monitoring and the 
Marine Strategy Framework Directive (MSFD)', 
with guidelines for the integration process of ge- 
nomic methods into ongoing monitoring pro- 
grams. Genomic methods have a high potential to 
provide ecosystem-based knowledge in multiple 
areas of marine monitoring, such as providing a 
consistent and high quality standard of species 
level identification, generating biodiversity esti- 
mates that include all branches of life, providing 
estimates of ecosystem function at all biological 
levels, discovering ecosystem wide trophic inter- 
actions and estimating genetic diversity at the 
population level. The methods identified, such as 
barcoding, amplicon sequencing, microarrays, 
quantitative real-time polymerase chain reaction 
(qRT-PCR), short nucleotide polymorphisms 
(SNP's), transcriptomics and metagenomics can 
contribute to developing 'next generation', inte- 
grated, monitoring approaches. 

The roadmap will include a summary of methods 
that have the potential to contribute to or gener- 
ate new knowledge mapped against MSFD indica- 
tors, and a cost-benefit comparison (based on ex- 
isting case studies that use these methods), as well 
as a discussion of analytical approaches. In addi- 
tion, the white paper will provide an overview of 
infrastructures needed for genomic monitoring 
[e.g., GOs Network) and stakeholders involved, 
such as national environmental agencies working 
with the MSFD, and also Regional Seas Conven- 
tions and the EU commission. 

This paper will be used as a 'living document' for 
communication with organizations involved in 
monitoring. 

Deliverables: MG4U and GSC will deliver the 
guideline paper 'Genomic Methods in Marine Mon- 
itoring and the Marine Strategy Framework Di- 
rective (MSFD)' with a roadmap for the integra- 
tion of genomic methods in marine monitoring 
programs. 
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GSC Biocode Commons Hackathon 

The GSC has previously hosted two small 
hackathons offering a chance for technical work to 
take place on specific topics. This was the first GSC 
meeting to include a hackathon. Two topics were 
taken forward. John Deck (University of California 
Berkeley], USA led a group on "Building a Refer- 
ence Model for Bio-Collections" and Renzo 
Kottman (Max Planck Institute Bremen, Germany] 
led a group on Field Information Management Sys- 
tems (FIMS] using OSD and the Euromarine pro- 
ject as Use Cases. 

Participants in the first hackathon included John 
Deck (University of California Berkely, USA], Nor- 
man Morrison (University of Manchester, UK], 
Sujeevan Ratnasingham (University of Guelph, 
Canada], Phillipe Rocca-Serra (University of Ox- 
ford, UK], Barry Smith (University of Buffalo, 
USA], Ramona Walls (New York Botanical Garden, 
USA], Trish Whetzel (Stanford University, USA], 
John Wieczorek (University of California Berkeley, 
USA], Jai Rideout (Northern Arizona University, 
USA], Matthew Bietz (University of Washington, 
USA], Steven Stones-Havas (Biomatters Ltd., New 
Zealand], Gabriele Droge (Free University of Ber- 
lin, Germany], Eamonn O'Tuama (Global Biodiver- 
sity Information Facility, Denmark], Peter Sterk 
(University of Oxford, UK], Robert Robbins (Uni- 
versity of California San Diego], Pier Buttigieg 
(AWI, Jacobs University Bremen, Germany], Reed 
Beaman (University of Florida, USA], and Michelle 
Koo (University of California Berkeley, USA]. Par- 
ticipants in the second hackathon: Juliane Casquet 
(CNRS & University of Toulouse, France], Julia 
Schnetzer (Max Planck Institute Bremen, Germa- 
ny], Steve Stones-Havas, John Deck, Jai Rideout, 
Matthew Bietz, Robert Robbins, Norman Morrison 
(University of Manchester, UK]. Eamonn O'Tuama, 
Peter Sterk, Philippe Rocca-Serra, Pier Luigi 
Buttigieg, Evangelos Pafilis (HCMR Crete, Greece], 
Giorgos Kotoulas (HCMR Crete, Greece], Linda 
Amaral-Zettler (Marine Biological Laboratory, 
Woods Hole, USA], Neil Davies (University of Cali- 
fornia Berkeley, French Polynesia], Gabi Droge, 
Chris Meyer (Smithsonian Institution, USA], 
Mesude Bicak (NERC Centre for Ecology and Hy- 
drology Wallingford, UK], Inigo San Gil (University 
of New Mexico / LTER, USA], Pelin Yilmaz (Max 
Planck Institute Bremen, Germany], Anna 
Klindworth (Max Planck Institute Bremen, Ger- 
many], and Ramona Walls. 



Building a Reference Model for Bio- 
Collections Hackathon 

The first hackathon built a reference model for 
bioCollections, using Protege [36] to model this 
within the basic formal ontology framework [37] 
with the resulting product posted to the Biocode 
Commons Google Code site [38]. Primary contri- 
butions to this ontology included definitions for 
types of sampling processes, including, for in- 
stance, identification process, observing process, 
sub-sampling process (a complete list of processes 
can be found by examining the ontology file post- 
ed on the Google Code site. In addition to the on- 
tology output we proposed expanding the Darwin 
Core "Basis of Record" to include DNAExtract and 
Subsampled Tissue in order to align Darwin Core 
with what we discussed. Several further actions 
were identified that could be addressed at future 
events. 

Field Information Management Systems 
Hackathon: 

The second hackathon dealt with "Methods linking 
data at the point of sampling & collection" as part 
of EuroMarine [39]. The focus was on advancing 
technological developments to consistently track 
samples and contextual data to link multidiscipli- 
nary data from the point of collection. The two 
goals were to (i] track samples using online data- 
bases synchronized with offline field/lab applica- 
tions (spreadsheets] that are compatible with data 
collection across the fields encompassed in 
EuroMarine and (ii] enable linking multidiscipli- 
nary data from the point of collection using for ex- 
ample geo-references and supporting structured 
storage and electronic exchange, thus reducing the 
need for retrospective text and data mining. This 
hackathon was a follow up of the EuroMarine Task 
4.2 expert workshop held on 29 th September 2011 
at the Max Planck Institute for Marine Microbiolo- 
gy in Bremen, Germany. All participants using or 
developing field information management sys- 
tems (FIMS] introduced their tools and approach- 
es and detailed on the efforts to develop and main- 
tain the tools. This laid the foundation to create a 
concept for a mobile device (a smartphone appli- 
cation] able to support the data acquisition and 
distribution. It was decided to create such an app 
first for Android phones and later for IPhones and 
use the upcoming pilot Ocean Sampling Days [29] 
of the Micro B3 Project [7] as use cases. The com- 
plete protocol of the Hackathon is available at 
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http://www.euromarineconsortium.eu/download 

s/category/17- 

sam- 

pling%20and%20data%20collection?download=l 
56:euromarine-t4-2-report-05-ll-2012-final . 

Conclusions 

In conclusion, this was one of the largest GSC 
meetings to date and served to launch the GOs 
Network (GOsl meeting). There was strong sup- 
port from a range of candidate genomic observa- 
tories (Table 1) and GOs Partners (Table 2). There 
was discussion of potential GOs Actions and Ocean 
Sampling Day is building momentum. This meet- 
ing identified a GOs core group of champions, with 
an overarching focus on biodiversity genomics 
and at least one working group in four key areas: 
science, informatics, technologies, and policy. The 
GOs core group will work towards a mission 
statement, a founding charter, defining member- 
ship, and recruitment of further GOs and Partners. 
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